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Multiuser MIMO Systems 
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Abstract 

A multiplicity of autonomous terminals simultaneously transmits data streams to a compact array of antennas. 
The array uses imperfect channel-state information derived from transmitted pilots to extract the individual data 
streams. The power radiated by the terminals can be made inversely proportional to the square-root of the number 
CsJ I of base station antennas with no reduction in performance. In contrast if perfect channel-state information were 
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available the power could be made inversely proportional to the number of antennas. Lower capacity bounds 
for maximum-ratio combining (MRC), zero-forcing (ZF) and minimum mean-square error (MMSE) detection are 
derived. A MRC receiver normally performs worse than ZF and MMSE. However as power levels are reduced, 
the cross-talk introduced by the inferior maximum-ratio receiver eventually falls below the noise level and this 
j-^ I ', simple receiver becomes a viable option. The tradeoff between the energy efficiency (as measured in bits/J) and 

spectral efficiency (as measured in bits/channel use/terminal) is quantified. It is shown that the use of moderately 
O ■ large antenna arrays can improve the spectral and energy efficiency with orders of magnitude compared to a 

_ single-antenna system. 
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■ I. INTRODUCTION 

In multiuser multiple-input multiple-output (MU-MIMO) systems, a base station (BS) equipped with 
multiple antennas serves a number of users. Such systems have attracted much attention for some time now 
^ ■ Conventionally, the communication between the BS and the users is performed by orthogonalizing 
the channel so that the BS communicates with each user in separate time-frequency resources. This 
is not optimal from an information-theoretic point of view, and higher rates can be achieved if the 
BS communicates with several users in the same time-frequency resource |l3l, BUl. However, complex 
techniques to mitigate inter-user interference must then be used, such as maximum-likelihood multiuser 
detection on the uplink or "dirty-paper coding" on the downlink ||6l, fTl. 

Recently, there has been a great deal of interest in MU-MIMO with very large antenna arrays at the 
BS. Very large arrays can substantially reduce intracell interference with simple signal processing [[8]|. 
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We refer to such systems as "very large MU-MIMO systems" here, and with very large we mean arrays 
comprising say a hundred, or a few hundreds, of antennas, simultaneously serving tens of users. The 
design and analysis of very large MU-MIMO systems is a fairly new subject that is attracting substantial 
interest [[8||-[fTT]|. The vision is that each individual antenna can have a small physical size, and be built 
from inexpensive hardware. With a very large antenna array, things that were random before start to 
look deterministic. As a consequence, the effect of small-scale fading can be averaged out. Furthermore, 
when the number of BS antennas grows large, the random channel vectors between the users and the BS 
become pairwisely orthogonal [[TOll . In the limit of an infinite number of antennas, with simple matched 
filter processing at the BS, uncorrelated noise and intracell interference disappear completely (jH. Another 
important advantage of large MIMO systems is that they enable us to reduce the transmitted power. On the 
uplink, reducing the transmit power of the terminals will drain their batteries slower. On the downlink, 
much of the electrical power consumed by a BS is spent by power amplifiers and associated circuits 
and cooling systems lfT2l . Hence reducing the emitted RF power would help in cutting the electricity 
consumption of the BS. 

This paper analyzes the potential for power savings on the uplink of very large MU-MIMO systems. 
We derive new capacity bounds of the uplink for finite number of BS antennas. These results are different 
from recent results in [[T4| and [fTSl . In [[T4| and ifTSl . the authors derived a deterministic equivalent of the 
SINR assuming that the number of transmit antennas and the number of users go to infinity but their ratio 
remains bounded for the downlink of network MIMO systems using a sophisticated scheduling scheme 
and MISO broadcast channels using zero-forcing (ZF) precoding, respectively. While it is well known that 
MIMO technology can offer improved power efficiency, owing to both array gains and diversity effects 
lfT3l . we are not aware of any work that analyzes power efficiency of MU-MIMO systems with receiver 
structures that are realistic for very large MIMoQ We consider both single-cell and multicell systems, but 
focus on the analysis of single-cell MU-MIMO systems since: i) the results are easily comprehensible; 
ii) it bounds the performance of a multicell system; and iii) the single-cell performance can be actually 
attained if one uses successively less-aggressive frequency-reuse (e.g., with reuse factor 3, or 7). The 
paper makes the following specific contributions: 

• We show that, when the number of BS antennas M grows without bound, we can reduce the 
transmitted power of each user proportionally to 1/M if the BS has perfect channel state information 
(CSI), and proportionally to if CSI is estimated from uplink pilots. This holds true even when 

using simple, linear receivers. We also derive closed-form expressions of lower bounds on the uplink 

' After submitting this work, otlier papers have also addressed the tradeoff between spectral and energy efficiency in MU-MIMO systems. 
An analysis related to the one presented here but for the downlink was given in 1161 . However, the analysis of the downlink is quantitatively 
and qualitatively different both in what concerns systems aspects and the corresponding the capacity bounds. 
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achievable rates for finite M, for the cases of perfect and imperfect CSI, assuming MRC, ZF, and 
minimum mean-squared error (MMSE) receivers, respectively. See Section Hill 
• We study the tradeoff between spectral efficiency and energy efficiency. For imperfect CSI, in the low 
transmit power regime, we can simultaneously increase the spectral-efficiency and energy-efficiency. 
We further show that in large-scale MIMO, very high spectral efficiency can be obtained even with 
simple MRC processing at the same time as the transmit power can be cut back by orders of magnitude 
and that this holds true even when taking into account the losses associated with acquiring CSI from 
uplink pilots. MRC also has the advantage that it can be implemented in a distributed manner, 
i.e., each antenna performs multiplication of the received signals with the conjugate of the channel, 
without sending the entire baseband signal to the BS for processing. See Section |IVl 

II. System Model and Preliminaries 

A. MU-MIMO System Model 

We consider the uplink of a MU-MIMO system. The system includes one BS equipped with an array 
of M antennas that receive data from K single-antenna users. The nice thing about single-antenna users 
is that they are inexpensive, simple, and power-efficient, and each user still gets typically high throughput. 
Furthermore, the assumption that users have single antennas can be considered as a special case of users 
having multiple antennas when we treat the extra antennas as if they were additional autonomous users. 
The users transmit their data in the same time-frequency resource. The M x 1 received vector at the BS 
is 

y = y/p^Gx + n (1) 

where G represents the M x K channel matrix between the BS and the K users, i.e., Qmk — [^Imfc is 
the channel coefficient between the mth antenna of the BS and the kth user; ^yp^x is the K x 1 vector 
of symbols simultaneously transmitted by the K users (the average transmitted power of each user is 
j»u); and n is a vector of additive white, zero-mean Gaussian noise. We take the noise variance to be 
1, to minimize notation, but without loss of generality. With this convention, j»u has the interpretation 
of normalized "transmit" SNR and is therefore dimensionless. The model ([I]) also applies to wideband 
channels handled by OFDM over restricted intervals of frequency. 

The channel matrix G models independent fast fading, geometric attenuation, and log-normal shadow 
fading. The coefficient can be written as 

gmk = h^k\/Wk, m = l,2,...,M (2) 

where hmk is the fast fading coefficient from the fcth user to the mth antenna of the BS. models the 
geometric attenuation and shadow fading which is assumed to be independent over m and to be constant 
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over many coherence time intervals and known a priori. This assumption is reasonable since the distances 
between the users and the BS are much larger than the distance between the antennas, and the value of 
13k changes very slowly with time. Then, we have 

G = HD^'^ (3) 

where H is the MxK matrix of fast fading coefficients between the K users and the BS, i.e., [H]^^i^ = hmk, 
and D is a K X K diagonal matrix, where [-D];,^ = (3k- Therefore, (H]) can be written as 

y = ^HD^/^x + n. (4) 

B. Review of Some Results on Very Long Random Vectors 

We review some limit results for random vectors ITtI that will be useful later on. Let p = [pi ... 
and q = [qi ... be mutually independent n x 1 vectors whose elements are i.i.d. zero-mean random 
variables (RVs) with E | = cTp, and E | = a^, i = 1, ...,n. Then from the law of large numbers, 

1 fj a.s. 2 1 1 ff n 

—p p-^a , and — p g — )■ 0, as n — )■ oo. (5) 

n n 

where denotes the almost sure convergence. Also, from the Lindeberg-Levy central limit theorem, 

^ CM (0, alafj , as n ^ oo (6) 



where A denotes convergence in distribution. 

C. Favorable Propagation 

Throughout the rest of the paper, we assume that the fast fading coefficients, i.e., the elements of H 
are i.i.d. RVs with zero mean and unit variance. Then the conditions in Q-® are satisfied with p and q 
being any two distinct columns of G. In this case we have 

M M 

and we say that we have favorable propagation. Clearly, if all fading coefficients are i.i.d. and zero 
mean, we have favorable propagation. Recent channel measurements campaigns have shown that multiuser 
MIMO systems with large antenna arrays have characteristics that approximate the favorable-propagation 
assumption fairly well [UOll . and therefore provide experimental justification for this assumption. 

To understand why favorable propagation is desirable, consider an M x K uplink (multiple-access) 
MIMO channel H, where M > K, neglecting for now path loss and shadowing factors in D. This 
channel can offer a sum-rate of 

K 

R = J2\0g2{l+PnXl) (7) 
fc=l 
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where is the power spent per terminal and {A;^.}^^ are the singular values of H, see |fT3l . If the 
channel matrix is normalized such that \Hij\ ~ 1 (where ~ means equality of the order of magnitude), 
then Ylk=i — II-^IP ~ MK. Under this constraint the rate R is bounded as 



The lower bound (left inequality) is satisfied with equality if = MK and A^ = ■ ■ ■ = A|; = 
and corresponds to a rank-one (line-of- sight) channel. The upper bound (right inequality) is achieved if 
Af = ■ ■ ■ = A|- = M. This occurs if the columns of H are mutually orthogonal and have the same norm, 
which is the case when we have favorable propagation. 



By using a large antenna array, we can reduce the transmitted power of the users as M grows large, 
while maintaining a given, desired quality-of-service. In this section, we quantify this potential for power 
decrease, and derive achievable rates of the uplink. Theoretically, the BS can use the maximum-likelihood 
detector to obtain optimal performance. However, the complexity of this detector grows exponentially 
with K. The interesting operating regime is when both M and K are large, but M is still (much) larger 
than K, i.e., 1 <^ K <^ M. It is known that in this case, linear detectors (MRC, ZF and MMSE) perform 
fairly well ^ and therefore we will restrict consideration to those detectors in this paper. We treat the 
cases of perfect CSI (Section IIII-A|) and estimated CSI (Section IIII-B|) separately. 
A. Perfect Channel State Information 

We first consider the case when the BS has perfect CSI, i.e. it knows G. Let AhQ an M x K linear 
detector matrix which depends on the channel G. By using the linear detector, the received signal is 
separated into streams by multiplying it with as follows 



log2 (1 + MKp^) <R< Klog^ (1 + Mp^) 



(8) 



III. Achievable Rate and Asymptotic (M -)> oo) Power Efficiency 



r = A^y. 



(9) 



We consider three conventional linear detectors MRC, ZF, and MMSE, i.e.. 



G 



for MRC 



A = I G (G^'G 




for ZF 



for MMSE 



(10) 



From ([T]) and the received vector after using the linear detector is given by 




(11) 



Let Tfc and Xk be the kth elements of the i^T x 1 vectors r and x, respectively. Then, 

K 



(12) 



i=l,i^k 
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where ak and are the fcth columns of the matrices A and G, respectively. For a fixed channel realization 
G, the noise-plus-interference term is a random variable with zero mean and variance Yld=i i^k + 
||afcp. By modeling this term as additive Gaussian noise independent of Xk we can obtain a lower bound 
on the achievable rate. Assuming further that the channel is ergodic so that each codeword spans over a 
large (infinite) number of realizations of the fast-fading factor of G, the ergodic achievable uplink rate of 
the kth user is 



i?P,fe = E log, 1+ V^^rL . „ „J (13) 





ak9k\ 


2 




K9^ 


I' + hkF 



To approach this capacity lower bound, the message has to be encoded over many realizations of all sources 
of randomness that enter the model (noise and channel). In practice, assuming wideband operation, this 
can be achieved by coding over the frequency domain, using, for example coded OFDM. 

Proposition 1: Assume that the BS has perfect CSI and that the transmit power of each user is scaled 
with M according to p^ = where is fixed. Thenj^ 

Rp^k ^ log2 (1 + /3fc^u) , M ^ oo. (14) 

Proof: We give the proof for the case of an MRC receiver. With MRC, A = G so ak = Qk- From 
(fT3] ), the achievable uplink rate of the kih. user is 

I V P^^^^=lMk\9k9^? + \\9k\\y ) 

Substituting Pu = ^ into (fT5l) . and using ([5]), we obtain (fT4l) . By using the law of large numbers, we 
can arrive at the same result for the ZF and MMSE receivers. Note from ^ and ([5]) that when M grows 
large, jjG^G tends to D, and hence the ZF and MMSE filters tend to that of the MRC. ■ 

Proposition [H shows that with perfect CSI at the BS and a large M, the performance of a MU-MIMO 
system with M antennas at the BS and a transmit power per user of E^jM is equal to the performance of 
a SISO system with transmit power E^, without any intra-cell interference and without any fast fading. In 
other words, by using a large number of BS antennas, we can scale down the transmit power proportionally 
to 1/Af. At the same time we increase the spectral efficiency K times by simultaneously serving K users 
in the same time-frequency resource. 

1) Maximum-Ratio Combining: For MRC, from (fT5l) . by the convexity of log, (l + ^) and using 
Jensen's inequality, we obtain the following lower bound on the achievable rate: 

> ^p!^ = iog2 (^i + [^[ '^^^"'^^p!^^^^^ (16) 

^ As mentioned after Q, pu has the interpretation of normalized transmit SNR, and it is dimensionless. Therefore Eu is dimensionless 
too. 
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Proposition 2: With perfect CSI, Rayleigh fading, and M > 2, the uplink achievable rate from the k\h 
user for MRC can be lower bounded as follows: 



RTi. = Ids, 1 + 



p„ (M-l)/3fc 



Proof: See Appendix lAl 
If = E^/M, and M grows without bound, then from (flTI) . we have 



(17) 



R^l = log, 1 + 



t(M-l)/3fe 

M Z^i=l,i^k K« -L, 



^ log2 (1 + f3kE^) , M ^ oo 



(18) 



Equation (fTSl ) shows that the lower bound in (fTT] ) becomes equal to the exact limit in Proposition \T\ as 
M ^ oo. 

2) Zero-Forcing Receiver: With ZF, = {G^G)~^G^, or = /a-. Therefore, a^^p^ = 5ki, 

where 5ki = 1 when k = i and otherwise. From (fT3]) , the uplink rate for the /cth user is 



Rfj, = E < log, I 1 



(19) 



J fefc' 



By using Jensen's inequality, we obtain the following lower bound on the achievable rate: 



Rtk > Kk = l0g2 1 



Pn 



E 



kk 



(20) 



Proposition 3: When using ZF, in Rayleigh fading, and provided that M > K + 1, the achievable 
uplink rate for the kth user is lower bounded by 



Rf, = \og,il+p^{M-K)P,) 



(21) 



Proof: See Appendix |Bl 
If = E^/M, and M grows large, we have 



Rf,k = ^og,(l + j^iM-K)P,] ^ log2 (1 + , M^oo 



(22) 



We can see again from (1221) that the lower bound becomes exact for large M. 

3) Minimum Mean-Squared Error Receiver: For MMSE, the detector matrix A is 



G"G + —Ik 

Pn 



-1 



= G^ { GG 



Pn 



■M 



-1 



(23) 



Therefore, the kth column of A is given by [[T8 



ttk = ( GG" + —Im) = H^^i^^ I 1 

Pn J g^Ak 9k + 1 



(24) 
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where = J2i=i i^kdidf + ^-^m- Substituting (|24l) into (fT3l) . we obtain the uplink rate for user k: 

( . \ 



i?^r = E {log2 (1 + A, ^p,) } ^= E <! log, 



E <^ log2 



f ^7m + GG 



H 



-1 



(b) 



E <^ log. 



J kk 



[iK+VnG^'GY^ 



(25) 



kk' 



where (a) is obtained directly from (l24l) . and (6) is obtained by using the identity 

-1 /I \ -1 



G" ( —7m + GG^ 



G 



Ja- + G^G G^G = - (7i^ + PuG^G) 



By using Jensen's inequality, we obtain the following lower bound on the achievable uplink rate: 



1 



E 



7fc 



(26) 



where jk 



1. For Rayleigh fading, the exact distribution of 7^ can be found in 



19|. This distribution is analytically intractable. To proceed, we approximate it with a distribution which 



has an analytically tractable form. More specifically, the PDF of 7^ can be approximated by a Gamma 
distribution as follows [|20l: 



where 



P^k (7) 



{M - K +1 + {K -1) fxf 



7' 



M-7r + i + (i^-i)K 



M -K + 1 + {K -1)k ' M-K + l + {K-l)i^ 

and where /i and k, are determined by solving following equations: 

1 



(27) 



(28) 



K-l ^ 

i=l,ij^k 



.J^k (1 



M 



K 1+ 5^ 



(MpuA (1 - 



K-\ _|_ A'-l 



PuA/i + 1 



M ■ M + 1) / (^Pu A (1 - ^ + ^/^) + 1) ' 

Using the approximate PDF of 7^ given by (1271 ). we have the following proposition. 



(29) 



Proposition 4: With perfect CSI, Rayleigh fading, and MMSE, the lower bound on the achievable rate 
for the A;th user can be approximated as 



Rv.k 



log, (1 + {ak - 1) Qk) 



Proof: Substituting (|27]) into (|26l) . and using the identity |I2T1 eq. (3.326.2)], we obtain 

r(a;fc) 



i?^r = log, 1 



r {ak - 1 



-Qk 



(30) 



(31) 
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where T (■) is the Gamma function. Then, using T {x + 1) = xT (x), we obtain the desired result (|30l ). ■ 
Remark 1: From (fT3l) . the achievable rate Rp^k can be rewritten as 

The inequality is obtained by using Cauchy-Schwarz' inequality, which holds with equality when = 
cK'^^Qf,, for any c G C. This corresponds to the MMSE detector (see (l24l)). This implies that the MMSE 
detector is optimal in the sense that it maximizes the achievable rate given by (fT3l ). 



B. Imperfect Channel State Information 

In practice, the channel matrix G has to be estimated at the BS. The standard way of doing this is to use 
uplink pilots. A part of the coherence interval of the channel is then used for the uplink training. Let T be 
the length (time-bandwidth product) of the coherence interval and let r be the number of symbols used 
for pilots. During the training part of the coherence interval, all users simultaneously transmit mutually 
orthogonal pilot sequences of length r symbols. The pilot sequences used by the K users can be represented 
by a r X A' matrix y/p^^ (r > K), which satisfies = 7^^, where = rp^- Then, the M x r received 
pilot matrix at the BS is given by 

Fp = ^G$^ + iV (33) 
where iV is an M x r matrix with i.i.d. CN (0, 1) elements. The MMSE estimate of G given Y is 

G = -^Yp^*b = (g + -^W) b (34) 



where W = iV$*, and D = {j^D^^ + 7^') ^ Since = Ik, W has i.i.d. CU (0, 1) elements. Note 
that our analysis takes into account the fact that pilot signals cannot take advantage of the large number 
of receive antennas since channel estimation has to be done on a per-receive antenna basis. All results 
that we present take this fact into account. Denote hy £ = G — G. Then, from (|34|) . the elements of the 
zth column of £ are RVs with zero means and variances ^ Furthermore, owing to the properties of 
MMSE estimation, £ is independent of G. The received vector at the BS can be rewritten as 

JJ^Gx — Jp^£x + n \ . (35) 



Therefore, after using the linear detector, the received signal associated with the kth user is 

K K 

rk = v^af - ^a^£x + a^n = ^/Ko-kh^k + ^ a^Oi^i - af e^Xi + af n (36) 

i=l,iy^k i=l 

where dk, Qi, and Si are the zth columns of A, G, and £, respectively. 
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Since G and £ are independent, A and £ are independent too. The BS treats the channel estimate as the 
true channel, and the part including the last three terms of (|36l) is considered as interference and noise. 
Therefore, an achievable rate of the uplink transmission from the A;th user is given by 

i?iP,. = E <; log, I 1 + .^"'5^!Lx . I ) (37) 




Intuitively, if we cut the transmitted power of each user, both the data signal and the pilot signal suffer 
from the reduction in power. Since these signals are multiplied together at the receiver, we expect that 
there will be a "squaring effect". As a consequence, we cannot reduce power proportionally to 1/M as 
in the case of perfect CSI. The following proposition shows that it is possible to reduce the power (only) 
proportionally to I/a/M. 

Proposition 5: Assume that the BS has imperfect CSI, obtained by MMSE estimation from uplink 
pilots, and that the transmit power of each user is = -j^, where is fixed. Then, 



Rip,k ^ log2 (1 + r(3lEl) ,M^oo. (38) 
Proof: For MRC, substituting = Qf. into (l37l) . we obtain the achievable uplink rate as 

R'^P,, = E (log2 f 1 + r^- i^/i^^'ts-^ ft p i I (39) 

I V PnE^=l,^^k\9k9^\^+Pu\\9k\\^Ei=l7^;;$-^l + \\9k\\y ) 

Substituting = E^/ \fM into (|39| ). and again using ^ along with the fact that each element of g^, is 
a RV with zero mean and variance -^^3%t> we obtain (l38l) . We can obtain the limit in (l38l) for ZF and 
MMSE in a similar way. ■ 

Proposition [5] implies that with imperfect CSI and a large M, the performance of a MU-MIMO system 
with an M-antenna array at the BS and with the transmit power per user set to E^/\fM is equal to the 
performance of an interference-free SISO link with transmit power r/dkE"^, without fast fading. 

Remark 2: From the proof of Proposition |5l we see that if we cut the transmit power proportionally 
to l/M"", where a > 1/2, then the SINK of the uplink transmission from the kth user will go to zero as 
M — !■ oo. This means that 1/\/M is the fastest rate at which we can cut the transmit power of each user 
and still maintain a fixed rate. 

Remark 3: In general, each user can use different transmit powers which depend on the geometric 
attenuation and the shadow fading. This can be done by assuming that the kth user knows /3k and performs 
power control. In this case, the reasoning leading to Proposition [5] can be extended to show that to achieve 
the same rate as in a SISO system using transmit power E^^, we must choose the transmit power of the 
kth user to be J^^. 
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Remark 4: It can be seen directly from (fT5l ) and (|39l) that the power-scaling laws still hold even for the 
most unfavorable propagation case (where H has rank one). However, for this case, the multiplexing gains 
do not materialize since the intracell interference cannot be cancelled when M grows without bound. 

1 ) Maximum-Ratio Combining: By following a similar line of reasoning as in the case of perfect CSI, 
we can obtain lower bounds on the achievable rate. 

Proposition 6: With imperfect CSI, Rayleigh fading, MRC processing, and for M > 2, the achievable 
uplink rate for the kth user is lower bounded by 



pmrc 



log2 



Pn {rp^f^k + 1) Ei=i,i^fc A + (r + 1) p^f3k + 1 _ 



(40) 



By choosing = E^/ y M, we obtain 



Ri^% ^ log, {l + T(3lEl), M^oo 



(41) 



Again, when M oo, the asymptotic bound on the rate equals the exact limit obtained from Proposition[5l 
2) ZF Receiver: For the ZF receiver, we have a^Qi = Ski- From (1371 ), we obtain the achievable uplink 
rate for the A;th user as 



R^' = E { log. 



( 



\ 



Pn 



T 



kk 



(42) 



Following the same derivations as in Section IHI- A2I for the case of perfect CSI, we obtain the following 
lower bound on the achievable uplink rate. 

Proposition 7: With ZF processing using imperfect CSI, Rayleigh fading, and for M > K + 1, the 
achievable uplink rate for the kth user is bounded as 



rpl (M - K) (31 



(rPu/3fc + l)Er=i 



TPuft + 1 



(43) 



Similarly, with = Eyj^/yM, when A/ — oo, the achievable uplink rate and its lower bound tend to 
the ones for MRC (see dS])), i.e.. 



^ik ^ log2 (1 + rPlEl) , M ^ oo 



(44) 



which equals the rate value obtained from Proposition [51 
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3) MMSE Receiver: With imperfect CSI, the received vector at the BS can be rewritten as 



y 



(45) 



Therefore, for the MMSE receiver, the A;th column of A is given by 



GG" + —Gov I 

Pn 



fv'J^x + n) 



K 9k 



9k K 9k + 1 

where Gov (a) denotes the covariance matrix of a random vector a, and 



(46) 



— H Mm 

1 Pu 



(47) 



Similarly to in Remark [H by using Cauchy-Schwarz' inequality, we can show that the MMSE receiver 
given by (|46|) is the optimal detector in the sense that it maximizes the rate given by (l37l) . 

Substituting (|46|) into (|37l) . we get the achievable uplink rate for the fcth user with MMSE receivers as 

1 



R 



P,k 



E < 



log2 



v 



K ft 



TPuft + 1 Pu 



-1 



>■ (48) 



kk 



J] 



Again, using an approximate distribution for the SINR, we can obtain a lower bound on the achievable 
uplink rate in closed form. 

Proposition 8: With imperfect CSI and Rayleigh fading, the achievable rate for the A;th user with MMSE 
processing is approximately lower bounded as follows: 



R'^'t = hgJl + {ak-l)9k 



where 



{M-K+1 + {K -l)iiy 
M-K + l + {K-l)k 



M-K + l + {K-l)k 



uj(3k 



(49) 



(50) 



where uj = ( Ylf=i 



TPuft + l Pu 



~ Tpu&H-l 



fi and k are obtained by using following equations: 



fx 



K 



K 



1 ^ 



1 



i=l,i^k 



Mu(3i{l- 



K-l j_ K-1 
M 



K 

+ Yl 7 



K 



uo(5ifi + 1 



+k Mo; A (1 



A) +1 



(51) 



Table |I] summarizes the lower bounds on the achievable rates for linear receivers derived in this section, 
distinguishing between the cases of perfect and imperfect CSI, respectively. 
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We have considered a single-cell MU-MIMO system. This simplifies the analysis, and it gives us 
important insights into how power can be scaled with the number of antennas in very large MIMO 
systems. A natural question is to what extent this power-scaling law still holds for multicell MU-MIMO 
systems. Intuitively, when we reduce the transmit power of each user, the effect of interference from 
other cells also reduces and hence, the SINR will stay unchanged. Therefore we will have the same 
power-scaling law as in the single-cell scenario. The next section explains this argument in more detail. 

C. Power-Scaling Law for Multicell MU-MIMO Systems 

We will use the MRC for our analysis. A similar analysis can be performed for the ZF and MMSE 
detectors. Consider the uplink of a multicell MU-MIMO system with L cells sharing the same frequency 
band. Each cell includes one BS equipped with M antennas and K single- antenna users. The M x 1 
received vector at the /th BS is given by 

L 

Vi = VK ^ GiiXi + ni (52) 

i=l 

where ^Jp^Xi is the K x 1 transmitted vector of K users in the zth cell; ni is an AWGN vector, ni ~ 
CJ\f {0,Im)', and Gu is the M x K channel matrix between the /th BS and the K users in the ith cell. 
The channel matrix Gu can be represented as 

Gh = HuD]i^ (53) 

where Hu is the fast fading matrix between the /th BS and the K users in the ith cell whose elements 
have zero mean and unit variance; and Du h K x K diagonal matrix, where [Dii\j^f^ = (3iik, with (3iik 
represents the large-scale fading between the kth user in the i cell and the /th BS. 

1 ) Perfect CSl: With perfect CSI, the received signal at the /th BS after using MRC is given by 

L 

ri = Gf,yi = y^G^GuXi + Yl ^nGuX, + G^^m (54) 
With = ff, (l54l ) can be rewritten as 

1 r, ^ VeA, + ^ X: + 4,0- (55) 



1=1,1^1 

From Q-®, when M grows large, the interference from other cells disappears. More precisely, 

^ Ti ^ ^,DuXi + D]/\ (56) 



where fii ~ CJ\f (0,1). Therefore, the SINR of the uplink transmission from the kth user in the /th cell 
converges to a constant value when M grows large, more precisely 

SINRP, ^ A/fcK, as M ^ oo (57) 

This means that the power scaling law derived for single-cell systems is valid in multicell systems too. 
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2) Imperfect CSI: In this case, the channel estimate from the uplink pilots is contaminated by inter- 
ference from other cells. The MMSE channel estimate of the channel matrix Gu is given by [fTTI 



Gu= \ Y,Gi 



,i=l 



Wi Da 



where Da is a diagonal matrix where the A;th diagonal element 
received signal at the Ith BS after using MRC is given by 



D, 



Ikk \ 



ri 



GaVi = Du Gi, + -^=^^ Yl + 



(58) 



The 



(59) 



With pu = EJ\/M, we have 



^ Dan 



M ^ Z^ A/f 3/4 + , /7 



^ W','Gn 



i=l j=l j=l 

By using ([5]) and as M grows large, we obtain 



i=l 



1 Wfni 



(60) 



1 ^-1. /7— 

i=l 



(61) 



where ~ CA/" (0, Jm)- Therefore, the asymptotic SINR of the uplink from the kth user in the Zth cell is 



as M — )■ oo. 



(62) 



r 



We can see that the power-scaling law still holds. Furthermore, transmission from users in other 

cells constitutes residual interference. The reason is that the pilot reuse gives pilot-contamination-induced 
inter-cell interference which grows with M at the same rate as the desired signal. 



IV. Energy-Efficiency versus Spectral-Efficiency Tradeoff 

The energy-efficiency (in bits/Joule) of a system is defined as the spectral-efficiency (sum-rate in 
bits/channel use) divided by the transmit power expended (in Joules/channel use). Typically, increasing 
the spectral efficiency is associated with increasing the power and hence, with decreasing the energy- 
efficiency. Therefore, there is a fundamental tradeoff between the energy efficiency and the spectral 
efficiency. However, in one operating regime it is possible to jointly increase the energy and spectral 
efficiencies, and in this regime there is no tradeoff. This may appear a bit counterintuitive at first, but it 
falls out from the analysis in Section |IV-A[ Note, however, that this effect occurs in an operating regime 
that is probably of less interest in practice. 

In this section, we study the energy- spectral efficiency tradeoff for the uplink of MU-MIMO systems 
using linear receivers at the BS. Certain activities (multiplexing to many users rather than beamforming 
to a single user and increasing the number of service antennas) can simultaneously benefit both the 
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spectral-efficiency and the radiated energy-efficiency. Once the number of service antennas is set, one can 
adjust other system parameters (radiated power, numbers of users, duration of pilot sequences) to obtain 
increased spectral-efficiency at the cost of reduced energy-efficiency, and vice-versa. This should be a 
desirable feature for service providers: they can set the operating point according to the current traffic 
demand (high energy-efficiency and low spectral-efficiency, for example, during periods of low demand). 

A. Single-Cell MU-MIMO Systems 

We define the spectral efficiency for perfect and imperfect CSI, respectively, as follows 

^ ~ T -T ^ ~ 

Rp =^ Rp^k, and Rip = ^ ^ Rip i, (63) 

fc=l k=l 

where A E {mrc, zf , mmse} corresponds to MRC, ZF and MMSE, and T is the coherence interval in 
symbols. The energy-efficiency for perfect and imperfect CSI is defined as 

rj^ = Lr^^ and r]fp = —Ri> (64) 

Pn Pn 

For analytical tractability, we ignore the effect of the large-scale fading here, i.e., we set D = I^. Also, 
we only consider MRC and ZF receivers 1^ 

For perfect CSI, it is straightforward to show from (flTl) . (|2TI) . and (|64l) that when the spectral efficiency 
increases, the energy efficiency decreases. For imperfect CSI, this is not always so, as we shall see next. 
In what follows, we focus on the case of imperfect CSI since this is the case of interest in practice. 

1) Maximum-Ratio Combining: From (|40|) . the spectral efficiency and energy efficiency with MRC 
processing are given by 



T-T ( riM-Dv^ \ 1 

R^r = —^K log, 1 + n 2^rA-l\ ^1 ' ^ip"^ = -^ip' (65) 

T V r(A - + (A +r)pu + 1/ Pu 



We have 



lim ,^p- = lim li?^- = lim ^i^^^M4^%^% = (66) 

Pu-^o pu^o pu^o T t{K-1)pI + {K + r)p^ + l ^ ^ 



and 



Hm T]^^^ = lim —R'^^' = (67) 

Equations (l66l) and (l67l) imply that for low the energy efficiency increases when increases, and 
for high the energy efficiency decreases when increases. Since > 0, Vj>u > 0, R'lp'^ is a 

monotonically increasing function of pu. Therefore, at low (and hence at low spectral efficiency), the 

^ When M is large, the performance of the MMSE receiver is very close to that of the ZF receiver (see Section|V]l. Therefore, the insights 
on energy versus spectral efficiency obtained from studying the performance of ZF can be used to draw conclusions about MMSE as well. 
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energy efficiency increases as the spectral efficiency increases and vice versa at high The reason is 
that, the spectral efficiency suffers from a "squaring effect" when the received data signal is multiplied 
with the received pilots. Hence, at <^ 1, the spectral-efficiency behaves as ~ p\. As a consequence, 
the energy efficiency (which is defined as the spectral efficiency divided by pu) increases linearly with 
Pu. In more detail, expanding the rate in a Taylor series for p^^ <^\, we obtain 



J5 pmrc 



LIP ~ JLjp Ip^^o 



dpu 



1 d'^Rf^'' 



+ 2 " dpi 



pI = ^7f7-K log, (e) T{M-l)pi (68) 



Pu = 

This gives the following relation between the spectral efficiency and energy efficiency at p^ <^ 1: 



Vtp' = \j \og, (e) r (M - 1) i?-- (69) 

We can see that when p^ <^ 1, hy doubling the spectral efficiency, or by doubling M, we can increase 
the energy efficiency by 1.5 dB. 

2) Zero-Forcing Receiver: From (l43l) . the spectral efficiency and energy efficiency for ZF are given by 



T \ (A + r) + 1 / Pn 

Similarly to in the analysis of MRC, we can show that at low transmit power p^, the energy efficiency 
increases when the spectral efficiency increases. In the low-pu regime, we obtain the following Taylor 
series expansion 



B^^ ^ -^K log2 (e) r (M - K) pi for < 1 (71) 



Therefore, 



r/Fl = J log, {e)T{M-K) Rf^ (72) 



Again, at <ti 1, by doubling M or i?jp, we can increase the energy efficiency by 1.5 dB. 

B. Multicell MU-MIMO Systems 

In this section, we derive expressions for the energy-efficiency and spectral-efficiency for a multicell 
system. These are used for the simulation in the Section |Vl Here, we consider a simplified channel model, 
i.e.. Da = Ik, and Du = (31 k, where (3 E [0, 1] is an intercell interference factor. Note that from (l58l) . 
the estimate of the channel between the A;th user in the /th cell and the /th BS is given by 

Qiik = ({L-l)(3 + l + -] (huk + V ^/Phuk + -^wik] (73) 

V PpJ \ i^, Vp^ J 

The term Xl/^fe ^fP^uk represents the pilot contamination, therefore ^'^'e{|I!,^2}'''^^ ^ = P {L — 1) can be 
considered as the effect of pilot contamination. 
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Following a similar derivation as in the case of single-cell MU-MIMO systems, we obtain the spectral 
efficiency and energy efficiency for imperfect CSI with MRC and ZF receivers, respectively, as follows: 

T-r / r(M-l)p2 \ i 
R^^ = —-K logo 1 + 7—\-, -, ^ , and Cui = — ^ip' (74) 

R-^ = Ik logo I 1 + v^- = t{M- K) pI ^ \ ^ ]_ 

"'"^ T rK{L^-Lf3 + f3-l)pl + L{K + T)p^ + lJ' ""^ Pu 

where L = (L — + The principal complexity in the derivation is the correlation between pilot- 
contaminated channel estimates. 

We can see that the spectral efficiency is a decreasing function of /3 and L. Furthermore, when L = 1, 
or /3 = 0, the resuks and (|75l) coincide with ([651) and dVO]) for single-cell MU-MIMO systems. 

V. Numerical Results 

A. Single-Cell MU-MIMO Systems 

We consider a hexagonal cell with a radius (from center to vertex) of 1000 meters. The users are located 
uniformly at random in the cell and we assume that no user is closer to the BS than rh = 100 meters. 
The large-scale fading is modelled via (3^ = z^/ {rk/rY^" , where is a log-normal random variable with 
standard deviation cTshadow. "^k is the distance between the fcth user and the BS, and v is the path loss 
exponent. For all examples, we choose ^shadow = 8 dB, and u = 3.8. 

We assume that the transmitted data are modulated with OFDM. Here, we choose parameters that 
resemble those of LTE standard: an OFDM symbol duration of Tg = 71.4yus, and a useful symbol duration 
of Tu = 66.7/iS. Therefore, the guard interval length is Tg = Tg — = 4.7/iS. We choose the channel 
coherence time to be Tc = 1 ms. Then, T = = 196, where ^ = 14 is the number of OFDM symbols 

Is J-g J-s 

in a 1 ms coherence interval, and = 14 corresponds to the "frequency smoothness interval" |l8l. 

1 ) Power-Scaling Law: We first conduct an experiment to validate the tightness of our proposed capacity 
bounds. Fig. [T] shows the simulated spectral efficiency and the proposed analytical bounds for MRC, ZF, 
and MMSE receivers with perfect and imperfect CSI at ]?u = 10 dB. In this example there are K = 10 
users. For CSI estimation from uplink pilots, we choose pilot sequences of length r = K. (This is the 
smallest amount of training that can be used.) Clearly, all bounds are very tight, especially at large M. 
Therefore, in the following, we will use these bounds for all numerical work. 

We next illustrate the power scaling laws. Fig. [21 shows the spectral efficiency on the uplink versus 
the number of BS antennas for p^ = E^/M and p^ = E^/ \/M with perfect and imperfect receiver CSI, 
and with MRC, ZF, and MMSE processing, respectively. Here, we choose E^^ = 20 dB. At this SNR, 
the spectral efficiency is in the order of 10-30 bits/s/Hz, corresponding to a spectral efficiency per user 
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of 1-3 bits/s/Hz. These operating points are reasonable from a practical point of view. For example, 64- 
QAM with a rate- 1/2 channel code would correspond to 3 bits/s/Hz. (Figure [3l see below, shows results 
at lower SNR.) As expected, with pu = E^/M, when M increases, the spectral efficiency approaches a 
constant value for the case of perfect CSI, but decreases to for the case of imperfect CSI. However, with 
Pu = Eu/\/M, for the case of perfect CSI the spectral efficiency grows without bound (logarithmically 
fast with M) when Af — )■ oo and with imperfect CSI, the spectral efficiency converges to a nonzero limit 
as M — oo. These results confirm that we can scale down the transmitted power of each user as E^/M 
for the perfect CSI case, and as Eu/^/M for the imperfect CSI case when M is large. 

Typically ZF is better than MRC at high SNR, and vice versa at low SNR |[13l. MMSE always performs 
the best across the entire SNR range (see Remark [1]). When comparing MRC and ZF in Fig. [2l we see 
that here, when the transmitted power is proportional to 1/v^M, the power is not low enough to make 
MRC perform as well as ZF. But when the transmitted power is proportional to 1/M, MRC performs 
almost as well as ZF for large M. Furthermore, as we can see from the figure, MMSE is always better 
than MRC or ZF, and its performance is very close to ZF. 

In Fig. [3l we consider the same setting as in Fig. |2l but we choose -Eu = 5 dB. This figure provides the 
same insights as Fig. [21 The gap between the performance of MRC and that of ZF (or MMSE) is reduced 
compared with Fig. [21 This is so because the relative effect of crosstalk interference (the interference from 
other users) as compared to the thermal noise is smaller here than in Fig. [2l 

We next show the transmit power per user that is needed to reach a fixed spectral efficiency. Fig. [H shows 
the normalized power (pu) required to achieve 1 bit/s/Hz per user as a function of M. As predicted by 
the analysis, by doubling M, we can cut back the power by approximately 3 dB and 1.5 dB for the cases 
of perfect and imperfect CSI, respectively. When M is large (M/K ^ 6), the difference in performance 
between MRC and ZF (or MMSE) is less than 1 dB and 3 dB for the cases of perfect and imperfect 
CSI, respectively. This difference increases when we increase the target spectral efficiency. Fig. [5l shows 
the normalized power required for 2 bit/s/Hz per user. Here, the crosstalk interference is more significant 
(relative to the thermal noise) and hence the ZF and MMSE receivers perform relatively better. 

2) Energy Efficiency versus Spectral Efficiency Tradeoff: We next examine the tradeoff between energy 
efficiency and spectral efficiency in more detail. Here, we ignore the effect of large-scale fading, i.e., we set 
D = Ik- We normalize the energy efficiency against a reference mode corresponding to a single- antenna 
BS serving one single- antenna user with p^ = 10 dB. For this reference mode, the spectral efficiencies 
and energy efficiencies for MRC, ZF, and MMSE are equal, and given by (from ([39l) and (|63l) ) 
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where z is a. Gaussian RV with zero mean and unit variance. For the reference mode, the spectral-efficiency 
is obtained by choosing the duration of the uplink pilot sequence r to maximize R^p. Numerically we 
find that R^p = 2.65 bits/s/Hz and r/fp = 0.265 bits/J. 

Fig. [6] shows the relative energy efficiency versus the the spectral efficiency for MRC and ZF. The 
relative energy efficiency is obtained by normalizing the energy efficiency by r/^p and it is therefore 
dimensionless. The dotted and dashed lines show the performances for the cases of M = 1,K = 1 and 
M = 100, K = 1, respectively. Each point on the curves is obtained by choosing the transmit power 
and pilot sequence length r to maximize the energy efficiency for a given spectral efficiency. The solid 
lines show the performance for the cases of M = 50, and 100. Each point on these curves is computed by 
jointly choosing K, r, and to maximize the energy-efficiency subject a fixed spectral-efficiency, i.e., 

arg max rjfp, s.t. i?fp = const., K <t <T 

Pu,K,T 

We first consider a single-user system with K = 1. We compare the performance of the cases M = 1 
and M = 100. Since K = 1 the performances of MRC and ZF are equal. With the same power used 
as in the reference mode, i.e., pu = 10 dB, using 100 antennas can increase the spectral efficiency and 
the energy efficiency by factors of 4 and 3, respectively. Reducing the transmit power by a factor of 100, 
from 10 dB to —10 dB yields a 100-fold improvement in energy efficiency compared with that of the 
reference mode with no reduction in spectral-efficiency. 

We next consider a multiuser system (K > 1). Here the transmit power p^, the number of users K, and 
the duration of pilot sequences r are chosen optimally for fixed M. We consider M = 50 and 100. Here 
the system performance improves very significantly compared to the single-user case. For example, with 
MRC, at = dB, compared with the case of M = 1, A' = 1, the spectral-efficiency increases by factors 
of 50 and 80, while the energy-efficiency increases by factors of 55 and 75 for M = 50 and M = 100, 
respectively. As discussed in Section |IVl at low spectral efficiency, the energy efficiency increases when 
the spectral efficiency increases. Furthermore, we can see that at high spectral efficiency, ZF outperforms 
MRC. This is due to the fact that the MRC receiver is limited by the intracell interference, which is 
significant at high spectral efficiency. As a consequence, when p^^ is increased, the spectral efficiency of 
MRC approaches a constant value, while the energy efficiency goes to zero (see (|67|)). 

The corresponding optimum values of K and r as functions of the spectral efficiency for M = 100 
are shown in Fig. U\ For MRC, the optimal number of users and uplink pilots are the same (this means 
that the minimal possible length of training sequences are used). For ZF, more of the coherence interval 
is used for training. Generally, at low transmit power and therefore at low spectral efficiency, we spend 
more time on training than on payload data transmission. At high power (high spectral efficiency and low 
energy efficiency), we can serve around 55 users, and K = t for both MRC and ZF. 
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B. Multicell MU-MIMO Systems 

Next, we examine the effect of pilot contamination on the energy and spectral efficiency for multicell 
systems. We consider a system with L = 7 cells. Each cell has the same size as in the single-cell system. 
When shrinking the cell size, one typically also cuts back on the power. Hence, the relation between signal 
and interference power would not be substantially different in systems with smaller cells and in that sense, 
the analysis is largely independent of the actual physical size of the cell ll23l . Note that, setting L = 7 
means that we consider the performance of a given cell with the interference from 6 nearest-neighbor 
cells. We assume Du = Ik, and Dn = (31 k, for i l.To examine the performance in a practical scenario, 
the intercell interference factor, (3, is chosen as follows. We consider two users, the 1st user is located 
uniformly at random in the first cell, and the 2nd user is located uniformly at random in one of the 6 
nearest-neighbor cells of the 1st cell. Let (3i and (32 be the large scale fading from the 1st user and the 
2nd user to the 1st BS, respectively. (The large scale fading is modelled as in Section IV-All ) Then we 
compute /3 as E {/32//3i}. By simulation, we obtain (3 = 0.32, 0.11, and 0.04 for the cases of (cTshadow = 8 

dB, U = 3.8, /reuse = 1), (o"shadow = 8 dB, U = 3, /reuse = 1), and (O"shadow = 8 dB, U = 3.8, /reuse = 3), 

respectively, where /reuse is the frequency reuse factor. 

Fig. [8] shows the relative energy efficiency versus the spectral efficiency for MRC and ZF of the multicell 
system. The reference mode is the same as the one in Fig. [6] for a single-cell system. The dotted line 
shows the performance for the case of M = 1, K = 1, and (3 = 0. The solid and dashed lines show 
the performance for the cases of M = 100, and L = 7, with different intercell interference factors (3 
of 0.32,0.11, and 0.04. Each point on these curves is computed by jointly choosing r, K, and to 
maximize the energy efficiency for a given spectral efficiency. We can see that the pilot contamination 
significantly degrades the system performance. For example, when (3 increases from 0.11 to 0.32 (and 
hence, the pilot contamination increases), with the same power, p^^ = 10 dB, the spectral efficiency and the 
energy efficiency reduce by factors of 3 and 2.7, respectively. However, with low transmit power where 
the spectral efficiency is smaller than 10 bits/s/Hz, the system performance is not affected much by the 
pilot contamination. Furthermore, we can see that in a multicell scenario with high pilot contamination, 
MRC achieves a better performance than ZF. 

VL Conclusion 

Very large MIMO systems offer the opportunity of increasing the spectral efficiency (in terms of 
bits/s/Hz sum-rate in a given cell) by one or two orders of magnitude, and simultaneously improving the 
energy efficiency (in terms of bits/J) by three orders of magnitude. This is possible with simple linear 
processing such as MRC or ZF at the BS, and using channel estimates obtained from uplink pilots even in a 
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high mobility environment where half of the channel coherence interval is used for training. Generally, ZF 
outperforms MRC owing to its ability to cancel intracell interference. However, in multicell environments 
with strong pilot contamination, this advantage tends to diminish. MRC has the additional benefit of 
facilitating a distributed per-antenna implementation of the detector. These conclusions are valid in an 
operating regime where 100 antennas serve about 50 terminals in the same time-frequency resource, each 
terminal having a fading-free throughput of about 1 bpcu, and hence the system offering a sum-throughput 
of about 50 bpcu. 

Appendix 

A. Proof of Proposition |2] 
From (fT6l). we have 



where cji = jj^- Conditioned on Qj^, gi is a Gaussian RV with zero mean and variance /3j which does not 
depend on Qj^. Therefore, gi is Gaussian distributed and independent of p^, gi ~ CA/'(0,/3j). Then, 

\7. |2 



X]i=l,i^fc \9i\ +1 



Using the identity [|2^ 



E{tr(W^-i)} = ^^ (78) 



m 
n — m 

where W ~ Wm {n,In) is an m x m central complex Wishart matrix with n (n > m) degrees of freedom, 
we obtain 

^\^r-u^]= (J ' forM>2 (79) 

Substituting (|79l ) into dTT]) . we arrive at the desired result (flTl) . 

5. Proof of Proposition \3\ 
From Q, we have 



(a) 



where (a) is obtained by using (fTSl) . Using (l80l) . we get (l2l 



Kf3k 

, for M > /s: + 1 (80) 
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TABLE I 

Lower bounds on the achievable rates of the uplink transmission for the fcTH user. 



23 



Perfect CSI 



Imperfect CSI 
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Fig. 1. Lower bounds and numerically evaluated values of the spectral efficiency for different numbers of BS antennas for MRC, ZF, and 
MMSE with perfect and imperfect CSI. In this example there are K — 10 users, the coherence interval T — 196, the transmit power per 
terminal is pu — 10 dB, and the propagation channel parameters were ashadow = 8 dB, and v = 3.8. 
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Fig. 2. Spectral efficiency versus tlie number of BS antennas M for MRC, ZF, and MMSE processing at tlie receiver, witli perfect CSI and 
witlr imperfect CSI (obtained from uplink pilots). In this example K = 10 users are served simultaneously, the reference transmit power is 
Eu = 20 dB, and the propagation parameters were (Tshadow = 8 dB and ly — 3.8. 
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Fig. 3. Same as Figure (2] but with Eu = 5 dB. 
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Fig. 4. Transmit power required to achieve 1 bit/channel use per user for MRC, ZF, and MMSE processing, witli perfect and imperfect CSI, 
as a function of the number M of BS antennas. The number of users is fixed \.o K = 10, and the propagation parameters are (Tshadow = 8 
dB and v = 3.8. 




Fig. 5. Same as Figure |4] but for a target spectral efficiency of 2 bits/channel use per user. 
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Fig. 6. Energy efficiency (normalized with respect to the reference mode) versus spectral efficiency for MRC and ZF receiver processing 
with imperfect CSI. The reference mode corresponds to K = 1,M = 1 (single antenna, single user), and a transmit power of pu = 10 
dB. The coherence interval is T = 196 symbols. For the dashed curves (marked with K = 1), the transmit power pu and the fraction of 
the coherence interval r/T spent on training was optimized in order to maximize the energy efficiency for a fixed spectral efficiency. For 
the green and red curves (marked MRC and ZF; shown for AI — 50 and M — 100 antennas, respectively), the number of users K was 
optimized jointly with pu and r/T to maximize the energy efficiency for given spectral efficiency. Any operating point on the curves can 
be obtained by appropriately selecting pu and optimizing with respect to K and r/T. The number marked next to the x marks on each 
curve is the power pu spent by the transmitter. 
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Fig. 7. Optimal number of users K and number of symbols r spent on training, out of a total of T = 196 symbols per coherence interval, 
for the curves in Fig. |6] corresponding to M = 100 antennas. 
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Fig. 8. Same as Figure (6] but for a multicell scenario, with L = 7 cells, and coherence interval T = 196. 



